Multiwords and Word Sense Disambiguation
نویسندگان
چکیده
This paper studies the impact of multiword expressions on Word Sense Disambiguation (WSD). Several identification strategies of the multiwords in WordNet2.0 are tested in a real Senseval-3 task: the disambiguation of WordNet glosses. Although we have focused on Word Sense Disambiguation, the same techniques could be applied in more complex tasks, such as Information Retrieval or Question Answering.
منابع مشابه
Senseval-3: The Italian all-words task
This paper describes the Italian all-words sense disambiguation task for Senseval-3. The annotation procedure and criteria together with the encoding of multiwords are presented.
متن کاملUsing LazyBoosting for Word Sense Disambiguation
This paper describes the architecture and results of the TALP system presented at the SENSEVAL-2 exercise for the English lexical–sample task. This system is based on the LazyBoosting algorithm for Word Sense Disambiguation (Escudero et al., 2000), and incorporates some improvements and adaptations to this task. The evaluation reported here includes an analysis of the contribution of each compo...
متن کاملرفع ابهام معنایی واژگان مبهم فارسی با مدل موضوعی LDA
Word sense disambiguation is the task of identifying the correct sense for the word in a given context among a finite set of possible sense. In this paper a model for farsi word sense disambiguation is presented. The model use two group of features: first, all word and stop words around target word and topic models as second features. We extract topics from a farsi corpus with Latent Dirichlet ...
متن کاملMaking Hidden Semantics of Hierarchical Classifications Explicit
Concept hierarchies are semi-structured knowledge repositories used for organizing large amounts of documents. File systems, products taxonomies for the market place and the directories provided by Web portals are common examples of concept hierarchies. We take the perspective in which such knowledge sources are inherently distributed and we address the problem of allowing their interoperabilit...
متن کاملMultilingual Wordnet sense Ranking using nearest context
In this paper, we combine methods to estimate sense rankings from raw text with recent work on word embeddings to provide sense ranking estimates for the entries in the Open Multilingual Wordnet (OMW).The existing Word2Vec Polyglot2 pre-trained models are only built for single word entries, we, therefore, re-train them with multiword expressions from the wordnets, so that multiword expressions ...
متن کامل